Quantifying collagen structure in breast biopsies 
using second-harmonic generation imaging 
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Abstract: Quantitative second-harmonic generation imaging is employed to 
assess stromal collagen in normal, hyperplastic, dysplastic, and malignant 
breast tissues. The cellular scale organization is quantified using Fourier 
transform-second harmonic generation imaging (FT-SHG), while the 
molecular scale organization is quantified using polarization-resolved 
second-harmonic generation measurements (P-SHG). In the case of FT- 
SHG, we apply a parameter that quantifies the regularity in collagen fiber 
orientation and find that malignant tissue contains locally aligned fibers 
compared to other tissue conditions. Alternatively, using P-SHG we 
calculate the ratio of tensor elements (dis/dn, d^dn, and d 33 ld 3 i) of the 
second-order susceptibility % for collagen fibers in breast biopsies. In 
particular, d [5 ld 3l shows potential differences across the tissue pathology. 
We also find that trigonal symmetry (3m) is a more appropriate model to 
describe collagen fibers in malignant tissues as opposed to the 
conventionally used hexagonal symmetry (C6). This novel method of 
targeting collagen fibers using a combination of two quantitative SHG 
techniques, FT-SHG and P-SHG, holds promise for breast tissue analysis 
and applications to characterizing cancer in a manner that is compatible 
with clinical practice. 
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1. Introduction 

Breast cancer is both a leading cause of cancer deaths and among the most frequently 
diagnosed cancers in women worldwide [1]. In 2012, it is estimated that -227,000 new 
patients will be newly diagnosed and -40,000 will die from breast cancer in the US alone [2]. 
In humans, almost all breast cancers arise from an uncontrolled growth of aberrant epithelial 
cells that originate either in the milk ducts or lobules and are categorized as ductal or lobular 
carcinoma, respectively. Excessive growth within human tissues can result in dense cellular 
masses ranging from hyperplasia to malignant tumors [3,4]. Though there are more cells in 
hyperplastic tissue, cellular structure and tissue morphology is similar to normal tissue. In 
dysplasia, both the cells and their organization are abnormal while neoplasia can be benign, 
potentially malignant (pre-cancer), or malignant (cancer). Most diagnostic and mechanistic 
studies in breast cancers focus on epithelial cells and significantly less work has been reported 
on the role of surrounding stroma and its extracellular matrix (ECM). Stromal involvement 
and its use as a diagnostic tool, however, is a rapidly emerging concept and is of keen 
contemporary interest [5-9]. A number of molecular and imaging techniques have been used 
to characterize the stroma [10-13]. In particular, the ECM predominantly contains type I 
fibrillar collagen [14], which has been recognized as being important for the initiation and 
progression of breast cancer [15-17]. Despite these promising results, the lack of analytical 
techniques to image the ECM is a major barrier to quantifying stromal role in breast cancer 
progression and, potentially, to a diagnostic protocol. 

A useful tool for investigating collagen fiber organization is second-harmonic generation 
(SHG) imaging, which has recently gained popularity for imaging collagen fibers in 
biological tissues [18-20]. SHG is a second-order, nonlinear optical process which originates 
from structures possessing non-centrosymmetry (i.e., those with no center of inversion 
symmetry) and a high second-order nonlinear coefficient, such as collagen fibers [21,22]. 
That is, when intense incident light interacts with the collagen fibers, it generates light at 
twice the incident frequency or half the incident wavelength. Since SHG is emitted mainly 
from collagen fibers, high-contrast images of collagen-based tissues can be obtained when 
used in an imaging modality. Moreover, since the intensity of the emitted second-harmonic 
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signal varies as the square of the incident light intensity, the excitation can be confined to a 
sub-femtoliter focal volume, thereby making it feasible to obtain 3D images. 

There has been much recent effort in developing useful metrics for quantifying collagen 
fiber organization utilizing SHG microscopy [23-25]. Quantification facilitates extraction of 
sensitive changes in collagen fiber organization due to damage or disease, and, thus, may help 
in early diagnosis. Studies are needed, however, that establish this approach as a clinically- 
compatible technique with a specific protocol for diagnostic use. Hence, the potential of SHG 
to understand and provide diagnostic information as well as technology development for this 
use are needed. As a first step towards that goal, here we first investigate the relative 
effectiveness of two techniques, Fourier transform-second harmonic generation imaging (FT- 
SHG) and polarization-resolved second-harmonic generation (P-SHG). FT-SHG involves 
utilizing 2D spatial Fourier transforms of SHG images to quantify collagen fiber organization 
at cellular scales (300 nm-100 (xm) [26-29]. Metrics such as preferred fiber orientation have 
previously been used to successfully assess tendon injury as well as to explore age-related 
structural changes in cortical bone [26,29]. P-SHG exploits the coherent signal generation by 
modulating the incident light polarization to obtain normalized tensor elements of the second- 
order susceptibility coefficient (x) of collagen fibers [21,23,24,30-33]. Assuming that type I 
collagen fibers possess cylindrical symmetry (C<5) crystal class, it has previously been shown 
that x can be used to differentiate between type I and type II collagen, muscle and type I 
collagen, and also to determine the orientation of molecules in collagen-based tissues 
[23,24,34].Therefore, seems to be sensitive for detecting structural changes at the 
molecular scale. 

P-SHG has been used previously in two reports for breast cancer detection, however, with 
mixed results [35,36]. In one study, the researchers used j 2 to estimate the orientation of 
collagen molecules in normal and cancerous tissues. Unfortunately, to their surprise, they did 
not observe any statistical differences between the two [35]. In the same year, another group 
reported the value of a particular matrix element in j 2 , i.e., d 22 for normal and breast cancer 
tissues. d 22 was normalized with respect to a nonlinear crystal (lithium niobate) and they 
found significant differences between the two [36]. Here, we sought to examine the potential 
of SHG imaging for analysis of breast tissue by building on these studies in several significant 
ways. As a prelude to eventual biomedical research and clinical translation, an overarching 
consideration in our study is the compatibility with current tissue examination protocols. 
While the developed methods are sought to be generally applicable to cancer research, we 
consider the aforementioned four breast tissue conditions as a relevant test case. Furthermore, 
we also assume a more general crystal class symmetry for collagen, i.e., trigonal symmetry 
(3m), and calculate all the (normalized) tensor elements for the various conditions. Finally, we 
also determine the unique combination of normalized tensor elements that are most sensitive 
for characterizing the four tissue conditions. This paper is organized as follows: we first 
discuss the methodologies of FT-SHG (section 2.1) and P-SHG (section 2.2). Section 3 
describes the sample preparation and experimental setup of the SHG microscope. In section 4, 
we discuss the results and discussion from the two techniques followed by the conclusion in 
section 5. 

2. Methodology 

2.1. FT-SHG 

Fourier transform-second harmonic generation imaging (FT-SHG) utilizes spatial Fourier 
analysis on SHG images and can be used to quantify collagen fiber (spatial) organization in 
biological tissues [26-29]. In this work, the relative, preferred, collagen fiber orientation is 
considered as a parameter for quantification. To calculate orientation, we employ a custom 
interactive graphical user interface program in Matlab (Math Works). Specifically, in our 
approach, the 2D Fourier transform of a region of interest in an image is calculated, and an 
angular power spectrum (strength of the various orientations within a particular region) is 
obtained. This plot is then fitted with a Gaussian distribution, the center of which gives the 
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preferred orientation of fibers (multiple fits for several orientations) with respect to a 
horizontal axis (chosen to be the long axis of the rectangular glass slide used), while the width 
of the distribution provides a measure of the randomness in fiber orientation. Further, we 
divide the image into smaller "sub-images", and determine the local fiber orientation within 
each of these smaller regions. 

We define the parameter A.I. ratio as the number of anisotropic to isotropic regions. 
Regions with fibers having a preferential orientation are categorized as anisotropic, while 
those with fibers aligned along several different orientations (with no preferred orientation) 
are considered isotropic. To decide whether a region is isotropic or anisotropic, we look at the 
angular power spectrum. For example, a single sharp peak in the angular power spectrum 
corresponds to a region containing fibers that are all oriented along the same direction, while 
multiple peaks in the power spectrum correspond to several unique directions that fibers are 
oriented along. Thus, A.I. ratio gives a measure of the number of regions with highly aligned 
fibers in comparison to those having randomly oriented fibers. Additional details on the 
methodology behind FT-SHG can be found in Refs. [26-29]. 

2.2. P-SHG 

In general, the induced polarization density (P) in a material is given by 

P = Z ^E + Z ^E 2 +Z ^E 3 + ..., (1) 

where E is the incident electric field and x <n> is tne « th -order nonlinear susceptibility. Since 
SHG is a second-order process, we consider only the second term. Thus, for an incident field 
with an angular frequency co interacting with a non-centrosymmetric material, the induced 
second-order polarization density P at frequency 2m, can be written as [37,38] 

P i {2a>) = 2j j d ijk E j {a>)E k {a>), i,j,k = 1,2,3, (2) 

where d^ is a 2nd-rank tensor corresponding to the 2nd-order nonlinear susceptibility, and 
Ej(co), E k (co), and Pfico) correspond to the components of the electric field and polarization 
density along the three principal axes of the material, respectively. Fortunately, assuming 
symmetry conditions such as intrinsic permutation, and the fact that P is Hermitian, the d 
tensor, can be represented as a 3x6 matrix, and therefore Eq. (2) can now be written as [38] 



P x (2a>) 
P„(2a>) 



As previously mentioned, we assume collagen fibers to possess the more general crystal class 
trigonal symmetry (3 in) [39]. Hence, the d matrix is given by [38] 
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where d 15 , d 2 2, d 31 , and d 33 are the only non-zero elements. Therefore, the components of the 
polarization density are given as 
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P x (2co) = 2d ]5 E x (co)E z {co)-2d 22 E y {<o)E x (co) 

P y (2a>) = d 22 {-E x 2 (co) + E; (co)) + 2d l5 E z (co)E v (co) (5) 

P (2a) = d 31 [E; (co) + E; (co)) + d 33 E 2 2 (co) . 

Figure 1 depicts the geometry that we consider for a single collagen fiber relative to an 
applied optical field. A collagen fiber is oriented along the z-axis while electric field E is 
incident normal to the y-z plane (along the x-axis), and a is the angle between the incident 
polarization and the z-axis such that E y (co) = E sin a, and E z (co) = E cos a. Under relatively 
weak focusing conditions the contribution of the longitudinal field component is much less 
than that of the transverse components and E x (co) is ~0. We obtain the emitted SHG intensity 
by taking the sum of the squares of the components of P, and thus obtain 



I(2co) 



(d 22 sin 2 a + d l5 sin 2a) +(<^ 3 i sin 2 a + d 33 cos 2 a) 



(6) 



From this equation we observe that the values of the tensor coefficients can be determined 
to within a constant by fitting the experimental plot of SHG intensity as a function of input 
polarization (polarization spectrum). Figure 2(a) shows three typical normalized plots of the 
polarization spectrum that can be obtained from Eq. (6) assuming appropriate values of tensor 
elements, i.e., d 15 = 1.0, d 33 = 1.6, d 3X = 0.8, and d 22 = 0.21. Panels (i) and (ii) in Fig. 2(a) 
shows polarization spectra for nonzero values of d 22 (-0.21 for (i) and 0.21 for (ii)), which 
exhibit noticeable asymmetry about 90°. Larger positive values of d 22 produces the pattern in 
(i), while larger negative values produces that shown in (ii); values of d 22 close to zero yield 
the typical spectral shape shown in (iii). Note that if collagen fiber is assumed to possess 
cylindrical crystal class (C6) symmetry, which is typically assumed, we obtain the spectrum 
shown in Fig. 2(b), which is the same as that of panel (iii) in 2a. It is clear then that class 3m 
is a more general symmetry class of which cylindrical symmetry is a special case. 



INPUT PUMP 
FIELD E 




SHG SIGNAL 
AFTER FILTERING 



Fig. 1 . Illustration of the geometric arrangement of a single collagen fiber relative to an applied 
electric field. The emitted SHG signal after spectral filtering is shown in blue. Details are 
provided in the text. 



3. Experiment 

3.1. Sample preparation 

To develop any imaging modality relevant to cancer, a large sampling diversity and number 
must be incorporated to avoid statistical pitfalls [40]. Tissue microarrays (TMAs) consist of 
representative tissue materials from a large number of patients that is arranged in a grid 
format and represent a convenient platform for developing imaging modalities for diagnostic 
purposes [41,42]. A breast tissue microarray (US Biomax BR1003) was obtained from 
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Fig. 2. Simulated plots of normalized SHG intensity as a function of polarization angle 
(polarization spectrum) for collagen fibers possessing a) trigonal 3m and b) cylindrical C6 
crystal class symmetry. 

formalin-fixed, paraffin-embedded tissue and contained 1-mm diameter circular cores of 
tissues for different histologic classes — normal, hyperplastic, dysplastic, and malignant (both 
ductal and lobular carcinoma). These samples were H&E stained and mounted in xylene 
based mounting medium. The sample thickness is ~5 (im, which is thin enough to produce 
high-resolution forward SHG images. It is notable that, unlike vibrational spectroscopic 
imaging techniques that need the sample to be unstained, the sample preparation protocol is 
entirely compatible with clinical processing. 

3.2. Experimental setup 

The schematic of the experimental setup for SHG imaging is shown in Fig. 3. An 1X81 
Olympus microscope is modified to incorporate both backward and forward collection 
geometries. The light source is a tunable Ti:sapphire laser (Spectra-Physics Mai-Tai HP 
DeepSee) that produces 100 femtosecond-duration pulses at 80-MHz repetition rate. The 
beam is linearly polarized and spectrally centered at 800 nm. It is then spatially filtered and 
collimated before it is scanned by a pair of galvo-mirrors (Cambridge Technology). The 
polarization incident on the sample is controlled by a polarizer and half-wave plate. The beam 
is then directed through a combination of relay lenses (scan and tube lens), reflected by a 
custom-made gold mirror (Lattice Electro Optics Au-B-l"xl"xlmmT) to preserve the 
polarization state. The polarization ratio at the sample plane is > 600:1. The beam is 
subsequently focused onto the sample using a 0.6 NA long working distance air objective 
(Olympus 40X LUCPLFLN) resulting in a focal spot size of -500 nm. The transmitted signal 





Class C6 




30 60 90 120 150 180 
Polarization ( ° ) 



#168709 - $15.00 USD Received 21 May 2012; revised 19 Jul 2012; accepted 4 Aug 2012; published 7 Aug 2012 
(C) 2012 OSA 1 September 2012 / Vol. 3, No. 9 / BIOMEDICAL OPTICS EXPRESS 2027 



from the sample is collected by a 0.66 NA air objective (Reichert 40X Plan Achromat). The 
SHG signal is filtered through a laser blocking filter (Semrock FF01-680/SP-25) and a 390/20 
nm bandpass filter (Semrock FF01-390/18-25), and collected by an EMCCD camera 
(Hamamatsu C9100-13). An average power of < 10 mW is used for all experiments. 

For P-SHG measurements, the forward-scattered SHG intensity is recorded as the incident 
polarization is varied between 0° to 180°. We find it sufficient to choose a step size of 30° to 
produce accurate measures of the d ratios. We chose to analyze 11 cores of normal, 10 cores 
of hyperplastic, 10 cores of dysplastic, and 11 cores of malignant tissue. From each core, 
regions (~5 \xm x 5 |j.m) containing a collagen fiber were randomly selected for analysis from 
one of five sectors (top, left, bottom, right, and center). Note that we take discrete 
measurements within a core as opposed to imaging the core pixel-by-pixel. In total, the 
respective sample size for normal, hyperplastic, dysplastic, and malignant tissue are 48, 48, 
42, and 46. All images are acquired from the surface of the sample to avoid birefringence and 
diattenuation (at large depths) [43,44], which may further complicate the P-SHG analysis. 

In the case of the FT-SHG imaging, the half-wave plate is replaced by a quarter-wave 
plate to generate circularly polarized light in order to ensure SHG emission from collagen 
fibers at all orientations. The gold mirror is replaced by a short-pass 760-nm dichroic beam 
splitter. Tiled backward SHG images are collected to image the whole area (~l-mm diameter) 
of the tissue core. The same set of filters and detector used in forward geometry are utilized 
for the backward geometry. For FT-SHG analysis, the number of cores considered for normal, 
hyperplastic, dysplastic, and malignant tissue are 19, 16, 18, and 29, respectively. 



Detector 



SHG Filter 
Laser Blocking Filter 




66 NA Collection Obj 



0.6 NA Illumination Obj 



Metal mirror 



Polarizer Scan Lens Tube Lens 



Fig. 3. Experimental setup of the SHG microscope with forward and backward collection 
geometries. Details of the setup are provided in the text. 



4. Results and discussion 

4.1. FT-SHG analysis 

A diversity of tissue samples, with different pathologic conditions and from different patients, 
is available using the TMA. Figure 4 shows a selection of tissues in which the brightfield 
(H&E stained) and SHG images of three representative cores from the four major histologic 
classes can be compared. The H&E stained images are the gold standard for clinical 
examination of biopsies. Typically, a manual examination of the H&E images by a 
pathologist allows for the visualization and identification of the major cell types and locations 
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Fig. 4. H&E stained and SHG images of three cores for normal, hyperplastic, dysplastic, and 
malignant breast tissues. Scale bar: 250 um. 



for imaging. While the H&E stained images provide a comprehensive view of tissue structure, 
the dye provides a visualization that is fairly non-specific in terms of histologic structures or 
underlying molecular content. The images shown in Fig. 4 are selected to cover the diversity 
of the samples and are, thus, generally representative of the histology in breast tissue. The 
corresponding SHG images are shown alongside the H&E images in which the green color 
provides a measurable signal and black areas indicate regions without measurable signal. The 
SHG intensity images provide a complementary, molecularly-sensitive visualization of breast 
tissue. As is expected, the SHG images are not sensitive to epithelial cells and these are 
clearly seen as missing from the images. Signal from the stromal regions, however, is large 
and images are of good quality. The correspondence of tissue histology and complementary 
information between the H&E and SHG images is enabled by the clinically-compatible 
protocol developed here. While the combined information can be examined for synergistic 
effects, we focused here on the information content of SHG images first. 

In SHG images, both gross areas of ECM abundance as well as finer structural detail can 
be observed. As a first level of stromal involvement in tumors, one expects to observe less 
ECM in larger tumors as the tissue volume is occupied by epithelial cells. That is, of course, a 
trivial outcome of the histology of solid tumors and not likely to be a very specific diagnostic 
parameter - hence, we do not focus on that aspect here. We have deliberately chosen TMA 
sections that contain some ECM for all histologic subtypes in order to analyze the stromal 
content with SHG. The selection of tissue regions for a valid comparison is not easy as the 
spatial distribution of cells within the field of view and cellularity within the ECM are 
obviously different for different pathologic states. Hence, we first wanted to ensure that the 
samples used in analysis here actually provide a measure of collagen structure and are not 
influenced by other micro structural entities in the tissues. We examined cellularity in the 
ECM in terms of the numbers of "dark" regions within the sample. A region is considered 
dark if the measured SHG intensity is low or negligible, which in the case of breast tissue 
represents an absence of collagen fibers. Figure 5 shows the bar plot of the number of dark 
regions within each tissue condition. The calculated values of number of dark regions are 43.6 
± 37.4, 46.6 ± 27.0, 83.2 ± 51.7, and 75.9 ± 61.4 for normal, hyperplastic, dysplastic, and 
malignant tissue, respectively. Applying a Kolmogorov-Smirnov comparison test, we find that 
this parameter does not provide any statistically significant differences at the 0.05 level. 
Hence, our regions used for comparison are truly similar and the underlying morphology will 
not present a confounding variable. 
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Fig. 5. Bar plot of the number of dark regions for the four tissue conditions. 
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Fig. 6. Bar plot of the parameter, A.I. ratio, for the four pathologic conditions. 

Qualitative differences were observed in the stroma between different pathologies but a 
more quantitative metric is needed for an objective test. Hence, we considered using FT-SHG 
to determine A.I. ratio for the four pathologic conditions as shown in Fig. 6. We find the 
calculated values of the A.I. ratio to be 2.8 ± 1.5, 3.5 ± 2.3, 4.5 ± 3.2, and 11.6 ± 6.7 for 
normal, hyperplastic, dysplastic, and malignant tissue, respectively. It is also observed that the 
A.I. ratio is the highest for malignant tissue indicating an increased regularity in fiber 
orientation compared to other tissue conditions. Indeed, Kolmogorov-Smirnov comparison 
tests showed statistically significant differences in this parameter between the malignant 
tissue and other conditions at the 0.05 level (note that the actual corresponding p-values are 
very close to zero which is indicative of the strength of correlation of this parameter). There is 
also significant variability in the results. This natural variation probably partially accounts for 
the ambiguous conclusions reported in the literature on using SHG to characterize tumors. We 
should also caution that the tumor associated stroma is likely associated only closely with the 
tumor itself. The proximity of the stroma to the tumor in a small TMA sample and our careful 
attention to ensuring that a fair comparison is made by using-well characterized spots likely 
explains the observed trend in our study. Significant care must be taken in transferring this 
result to large areas of unknown tissue. We believe that the observed characteristics of 
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malignant tissue arise from a local stiffening of fibroblasts and the attendant stromal 
remodeling. The spatial extent of this remodeling and its evolution with tumor progression 
should be studied in detail. Here, our goal was simply to develop a tool that can potentially 
examine these effects. Hence, we did not pursue a detailed study of stromal remodeling 
indicative of tumor progression. 

4.2. P-SHG analysis 

While the previous analysis provided an understanding of multiple collagen fibers' 
organization at the microscopic level, we also sought to investigate average changes in 
individual collagen fibers that influence their microscopic organization. Here, we apply P- 
SHG to calculate the normalized tensor elements of the second-order susceptibility matrix 
corresponding to collagen fibers. Figure 7 shows example polarization spectra for various 
conditions. On average, it is observed that collagen fibers in normal tissue (Fig. 7(a)) show 
prominent shoulders at -30° and 150° and is relatively symmetric about 90°. In contrast, 
many fibers in malignant tissue exhibit a polarization spectrum with little or no shoulders and 
an asymmetry about 90° (Fig. 7(d)). Collagen fibers in hyperplastic and dysplastic tissues 
show trends that lie between normal and malignant tissues. These trends are likely a 
consequence of both the remodeling of collagen as well as physical forces in tissue. 

Note that Eq. (6) is a proportional equation and as such we can only obtain normalized 
values of the tensor coefficients. For example, when normalized by d u , we can obtain d 22 /d [5 , 
d 31 /d 15 and d 33 /d Js . Note that, however, the choice of normalization is not arbitrary. We want 
to find the appropriate combination of ratios of tensor elements that contain the least amount 
of redundancy. For example, if d 15 is changing with each tissue condition, then other elements 
normalized by it (d 22 /d 15 , d 31 /d ls and d 33 /d 15 ) will all show the same change, thereby leading to 
redundancy. To minimize this, we consider all combinations of normalization (i.e., d 15 , d 3h 
d 33 , and d 22 ), and the resulting equations are fit to the experimental plot of polarization 
spectrum for all tissue conditions. We find that in the case of breast cancer, both d 31 and d 33 
provide the least redundancy; therefore, we choose to normalize the tensor elements by d 3I . 
The final expression for the SHG intensity as a function of polarization angle is 



I(2a) ■ 



( j j Y ( j \ 



-^sin a + — sin 2a 



d?? 

sin a + — cos a 

d 3l j 



(7) 



where the ratios d 22 /d 31 , d 15 /d 31 and d 33 /d 31 are calculated and used as metrics to differentiate 
between breast tissue conditions. 

After fitting the experimental plots of the polarization spectrum using Eq. (7), the d ratios, 
d\ S ldj,\, d 33 /d 3 i and d 22 /d 3l (absolute value) are calculated and plots are shown in Fig. 8. The 
average value of the ratio d l5 /d 3i ranges from 0.843 for normal to 0.728 for malignant tissue. 
Moreover, the standard deviation increases from a value of 0.064 for normal to 0.122 for 
malignant tissue. Similarly, differences are observed for d 22 /d 3l with a mean value of 0.052 
for normal and 0.116 for malignant tissue. Again, the standard deviation is observed to be 
higher for abnormal conditions, likely indicating greater heterogeneity. Interestingly, the ratio 
d-iild^x remains relatively constant across all conditions. These ratios can potentially function 
as metrics of pathology. There is, however, significant variance indicating the response and 
heterogeneity of stromal remodeling. 

In order to test if the d ratios can be used as a metric for differentiating various breast 
pathologies, we performed statistical comparison tests assuming the values within the core to 
be dependent and the values across the cores to be independent. Repeated measures ANOVA 
with between-subject factors was performed using OriginPro 8 and SAS. A condition for this 
analysis is that the distribution of values of d ratios must be normal. This is satisfied for the 
ratios dyd^ and d 22 /d 3l , but not with d^ld-n. However, for this case, we find that the log of 
d 33 /d 3 ] also shows a normal distribution thereby permitting us to apply the above analysis. The 
results of the comparison are shown in Table 1. Using d l5 /d 3 i and a p-value of 0.05, we find 
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that there seems to be the potential to differentiate between normal and dysplasia, normal and 
malignant, hyperplasia and malignant, and dysplasia and malignant. However, differentiation 
of hyperplasia from either normal or dysplasia could not be established. Interestingly, we did 
not observe any statistical differences between ductal and lobular carcinoma-associated 
stroma. 

Given the variance and the range in values, at this stage, it does not appear that there is a 
simple feature in SHG data that characterizes breast pathology. To more accurately estimate 
the use of SHG imaging data for breast pathology, we sought to devise a test based on 
similarity to normal tissue. We defined a filter that contains the range of values of d^d^x for 
normal tissue and we label the collagen fibers whose tensor values fall outside this normal 
range as "abnormal" collagen fibers. We then tabulated the number of abnormal collagen 
fibers in each tissue condition. The fraction of abnormal fibers defined in this manner is 
plotted for each condition in Fig. 9 and were observed to be 0.0% (by definition), 10.4%, 
19.1%, and 47.8% for normal, hyperplastic, dysplastic, and malignant tissue, respectively. It 
is evident that the number of abnormal fibers is distinctly different for different pathologies. It 
is also interesting to observe that about 52.2% of the collagen fibers in malignant tissue are 
normal fibers with this definition. This leads to the conclusion that there are few "global" 
changes in tissue stroma but different regions must be carefully probed. The second important 
conclusion is that imaging is needed and averaging measurements will likely yield to 
confusing results. Thus, studies designed to measure changes due to cancer progression will 
have to carefully segment the spatial variations and understand these in the context of the 
tumor. While our goal here was to determine whether there is any potential of using SHG 
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Fig. 7. Normalized SHG intensity as a function of polarization angle (or polarization spectrum) 
for a) normal, b) hyperplastic, c) dysplastic, and d) malignant breast tissue. 
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imaging to understand stromal changes in breast cancer, for future clinical translation, this 
issue of within sample and other variance must be examined through a larger study and 
location of fibers relative to the tumor. 

As observed from the measurements above, normal breast tissue contains collagen fibers 
whose dis/d-n value lies in the range of 0.843 ± 0.064, and d 33 /d 3l within the range 1.570 ± 
0.074. These values are close to reported values for type I collagen [24]. The values of ^22/^31 
for normal tissue lie close to zero (0.052 ± 0.043) indicating that the assumption of cylindrical 
symmetry is valid for normal tissue. This symmetry condition, however, does not seem to 
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Fig. 8. Bar plot of the tensor elements (d\ 5 /dn, dy)ld%i, ^22/^31) for normal, hyperplastic, 
dysplastic, and malignant tissues. 



Table 1. Comparison Tests between Breast Tissue Conditions" 





d,s/d }1 (ANOVA) 




d 33 /d }l (ANOVA) 




dnldi 


(K.S. Test) 




Groups 


M.D. 


p-value 


S.S. 


M.D. 


p-value 


S.S. 


M.D. 


p-value 


S.S. 


Normal vs. 
Hyperplasia 


0.023 


0.3259 


No 


-0.005 


0.8085 


No 


-0.011 


0.9845 


No 


Normal vs. 
Dysplasia 


0.059 


0.0170 


Yes 


0.003 


0.9122 


No 


-0.012 


0.1688 


No 


Normal vs. 
Malignant 


0.115 


O.0001 


Yes 


-0.040 


0.0438 


Yes 


-0.064 


0.0002 


Yes 


Hyperplasia 
vs. Dysplasia 


0.036 


0.1410 


No 


0.008 


0.7304 


No 


-0.001 


0.1641 


No 


Hyperplasia 
vs. Malignant 


0.092 


0.0003 


Yes 


-0.034 


0.0762 


No 


-0.053 


0.0002 


Yes 


Dysplasia vs. 


0.056 


0.0218 


Yes 


-0.043 


0.0395 


Yes 


-0.052 


0.0135 


Yes 



Malignant 

"The comparison test is performed at a p-value of 0.05, which corresponds to 95% confidence level. M.D. is the mean 
difference between the different tissue conditions. S.S. is the abbreviation for statistically significant. 
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Fig. 9. Percentage of abnormal collagen fibers as a function of breast pathology. 

hold for malignant tissues that contain abnormal fibers. Since malignant tissues also contain 
normal tissue-like fibers (-50% of the total fibers), the average differentiation appears to be 
smaller. This is also a likely reason for the large standard deviation associated with malignant 
tissues. Hence, we removed the contribution of normal fibers from malignant tissue and 
considered only abnormal fibers, which provided values of d^dn = 0.646 ± 0.122, d^ld^i = 
1.603 ± 0.123, and d 11 ld- i \ = 0.117 ± 0.087. Malignant tissue contains fibers that have 
comparatively smaller d l5 /dn and larger ^22/^31 values. Larger values of ^22/^31 manifest as 
asymmetry about 90°, and are an indication that trigonal symmetry is a more appropriate 
description for abnormal fibers. Our observed changes in the values of the d ratios and 
symmetry are interesting, likely indicating ECM remodeling that is observed during tumor 
progressions [45]. Molecular changes in collagen fibers were also reported in recent 
biochemical studies [46,47]: collagen present in the normal tissue is a heterotrimer consisting 
of two a 1(1) chains and one oc2(I) chain, while collagen found in malignant tissues are 
homotrimeric consisting of three a 1(1) chains. Unfortunately, resolving these changes are well 
below the diffraction-limit of conventional optical microscopes but P-SHG allows us to detect 
potential molecular-level changes optically. Matrix metalloproteinases (MMPs) are also 
upregulated in cancer [48] and are known to degrade type I collagen fibers. Furthermore, 
MMP upregulation also leads to a changed isoform of type 1 collagen, which they do not 
degrade and helps impart a more aggressive phenotype to cancer cells. Hence, the tools 
reported in this study can lead to several other basic studies on the mechanism of cancer 
progression. 

5. Conclusion 

We utilized two quantitative second-harmonic generation techniques, FT-SHG and P-SHG, to 
investigate differences in stromal collagen fibers for human breast tissues from different 
pathologic conditions, namely normal, hyperplasia, dysplasia, and malignant. Using FT-SHG, 
we quantified collagen fiber organization at cellular scales. We noted the observed differences 
between various pathologies and developed the metric A.I. ratio as a potentially useful one to 
differentiate malignant from other tissue conditions. Alternatively, P-SHG was used to infer 
structural changes in collagen at the molecular scale through the calculation of the normalized 
tensor elements of the second-order susceptibility. In particular, di 5 ld 3 i showed statistical 
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differences across breast tissue pathologies, except between hyperplasia and normal/dysplasia. 
We also found that trigonal symmetry (3m) is a more appropriate model to describe collagen 
libers in malignant tissues as opposed to the conventionally used hexagonal symmetry (C6). 
Furthermore, the percentage of abnormal collagen fibers could potentially be used as a metric 
for differentiating breast tissue conditions. This unique method of targeting collagen fibers 
using a combination of two quantitative SHG techniques, FT-SHG and P-SHG, seems 
promising for further study, especially as a complementary technique with gold-standard 
histopathological diagnosis. To that end, our SHG measurements were conducted on clinically 
compatible slides and can be easily incorporated within present clinical work. Future work 
includes application of quantitative SHG imaging for extension to in vitro breast cancer 
studies as well as other types of cancers. 
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